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Methods and apparatus for binarisinq images 
Field of the invention 

The present invention relates to methods for processing an image so as to 
classify pixels of the image based on an intensity threshold. In particular, the 
5 invention relates to such a method having an improved process for selection 
of the threshold. The invention is applicable to both medical and non-medical 
images. 

Background of Invention 

10 Binarisation is a well-known technique for image segmentation - that is 
classifying pixels of the image into two classes. Binarisation performs this 
classification based on whether a given pixel of the image has an intensity 
(gray-level) above or below a threshold. Binarisation has been widely applied 
to a number of image processing and computer vision applications, as a 

15 preliminary segmentation step. It makes an implicit assumption that an object 
of interest in the image has different intensity values from other (background) 
portions of the image. 

Many techniques exist for selection of the threshold. For example, in some 
20 such processes, the threshold can be selected in a process involving user 
interaction, while in other processes the threshold is selected entirely 
automatically. In some such processes the threshold is selected locally (i.e. 
such that the threshold varies from one pixel to another), while in other 
processes the threshold is the same over the whole image. 

25 

Most automatic threshold selection methods employ a histogram of the gray 
levels in the image. For example, Otsu [1] proposed a selection of the 
threshold to maximise the separability of the resultant classes in gray levels, 
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which is performed by minimising the within-class variance. Li and Lee [2] 
selected the threshold by minimising the cross entropy between the image 
and its segmented version. Kittler and lllingworth [3] selected the threshold by 
minimising the Bayes errors under the assumption that the object and pixel 
5 gray level values are normally distributed. Kapur et al [4] provided a maximum 
entropy approach. Wong and Sahoo [5] maximised the entropy with 
constraints on the region homogeneity and object boundary. Saha and Udupa 
[6] proposed a technique which maximised class uncertainty and homogeneity 
of the regions. Cheng et al [7] used the concept of fuzzy c-partition and the 
10 maximum fuzzy entropy principle to select a threshold. 

Cheung at al (US5,231,580A, 1993) disclosed an automatic method to 
characterise nerve fibres using local thresholds. It first partitions the entire 
image into sub-images and finds the threshold for each sub-image using a 
15 histogram-based thresholding method. Then, the pixel-wise threshold is 
approximated by interpolating the thresholds of neighbouring subimages. 

Summary of the Invention 

It is observed that the existing methods for selecting a threshold described 
20 above lack a mechanism for incorporating prior knowledge about the images 
to be binarised. 

Thus, the present invention aims to provide a new and useful technique for 
selecting a threshold for binarising an image, and in particular one which 
enables prior knowledge to be explicitly incorporated. 

25 In general terms, the invention proposes firstly that this prior knowledge is 
used to define a region of interest (ROI) in the image, such that the analysis of 
frequency distribution of pixel intensities (represented by a frequency 
histogram) is performed only for pixels in the ROI. Secondly, the invention 



WO 2005/057493 



PCT/SG2004/000403 



3 

proposes that the prior knowledge is used to select an intensity range, and 
that only pixels within this intensity range are used to generate the frequency 
distribution from which the threshold is selected. 

These two ideas are in principle separate, but in combination they provide a 
5 highly effective mechanism for incorporating prior knowledge into the 
threshold selection. The advantage is critical whether the image is a medical 
one or not. In particular, a threshold can be found to binarise images which 
exhibits high robustness to imaging artefacts such as gray level 
inhomogeneity and noise. 

10 Specifically, one expression of the invention is a method of binarising an 
image composed of pixels having respective intensity values, the method 
comprising: 

(i) using prior knowledge about the image to derive a region of interest 
within it- 
IS (ii) using prior knowledge about the image to derive an intensity range 
of pixels in the said region of interest; 

(iii) obtaining a frequency distribution of the intensities within the said 
intensity range of pixels within the said region of interest; 

(iv) using the said frequency distribution to derive an intensity 
20 threshold; and 

(v) binarising the image by classifying pixels in the said region of 
interest according to whether their intensities are above or below the said 
intensity threshold. 
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The invention may alternatively be expressed as a computer system which is 
set up to perform such a method. Alternatively, it can be expressed as 
software for performing the method. 

Brief Description of The Figures 

5 Preferred features of the invention will now be described, for the sake of 
illustration only, with reference to the following figures in which: 

Fig. 1 shows the steps in a method which is an embodiment of the 
invention; 

Fig. 2 shows an MR SPGR intercommissural axial slice of a brain, 
10 which is a suitable subject for the method of Fig. 1 ; 

Fig. 3 shows a region of interest within the image of Fig. 2 derived by a 
first step of the method of Fig. 1 ; 

Fig. 4 is a gray-level histogram of the ROl shown in Fig. 3, and a 
threshold selected in one form of a step of the method of Fig. 1; and 
15 Fig. 5 shows the binarised image using the threshold selected in the 

method of Fig. 1. 

Detailed Description of the embodiments 

20 

Referring firstly to Fig. 1, the overall steps of a method which is an 
embodiment of the invention are shown. 

In step 1, an image is input. 

25 

In step 2, prior knowledge of the image is used to define a region of interest 
(ROl) which is a subset of the image. This process can be done by whatever 
means, either automatic, semi-automatic, or even manual. 
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In step 3 an analysis is performed on the frequency of occurrence of 
intensities within the ROI, and a range of frequencies is defined, again using 
prior knowledge. 

5 For example, without losing generality, we denote the image to be processed 
as f(x), where f(x) is the gray level at a pixel labelled x. It is further supposed 
that the processed image has L gray levels denoted by r ; . where / is an integer 

in the range 0 to L-1 and r 0 <r, <...r M . It is also assumed that the object of 
interest has higher intensity values than the background. Suppose that due to 
10 prior knowledge or test we know that the proportion of the region of interest 
which is occupied by the object is in the percentage range per 0 to per?. 

Let h(i) denote the frequency of gray level r it and let H(i) denote the 
cumulative frequency which is /*(/')» where /' is an integer dummy index. 

i'=0 

15 Considering two values of / written as m and n, the frequency of intensities in 

n 

the range r m to r n is ^h(i % ) . Thus, we can use per 0 \o calculate a gray level 

How , such that we are sure that all the pixels having lower intensity represent 
background. n ow can be written as: 

20 r low = min{/ 1 HQ) > per,}. (1) 

Similarly, we can use pen to calculate a gray level r high such that we are sure 
that all pixels having higher intensity represent the object: 

r high =rmn{i\H(i)>per { }. (2) 

25 In a step 4 of the method of Fig 1, the threshold is selected using an algorithm 
which operates on the frequencies within the selected range from n ow to r^. 
The details of several ways in which this can be carried out within the scope 
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of the invention are given below. Thus, a selected threshold is output in step 
5. 

Image binarisation is then performed using this threshold, to create an image 
5 in which all pixels (at least in the ROI) are classified into two classes. Further 
image processing steps may optionally be performed at this stage. 

We now turn to a discussion of three techniques by which step 4 can be 
carried out. 

10 

1. Range-constrained least valley detection method (RCLVD) 

If the frequency range derived in step 3 is correctly estimated then it will 
include a valley in the frequency distribution of intensities. This valley 
15 separates the background and the object. Thus, valley detection can be 
exploited to select the threshold. This has the following steps: 

1) A frequency interval Sh is specified. 

2) The gray level range [r l4W ,r Aft J is partitioned into K+1 intervals with an equal 
20 frequency ranged. For an interval labelled by integer index j % the lower end 

of its intensity range is denoted r/ and the upper end is denoted r{ . Thus: 
r x = r low , r 2 ° = min{/ 1 H(i) > (per 0 + Sh)} , 

r} =r 2 °, r\ = Tmn{i\HQ)>(H(r x l ) + Sh)}, 

25 r* - r 2 K ~ l , r 2 K = minji | H(i) > (H(r^ ) + Sh) . 
H{r* +Sh)> per x and H(r* ) < per x . 

3) The average frequency h J for each of the intervals j is calculated given by 
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=(H{r 2 J )-H{r x J ))/(r 2 J -r x J ) 



4) Let J denote the interval for which h J is a minimum. The threshold of this 
RCVLD method, which is denoted 0 RCVLD , may be selected to be any value in 



5 

2. Range-constrained weighted variance method (RCWV) 

Let r k fall within the range r JOW to r^, and suppose that the pixels of the ROI 
are in two classes C? and C 2i where C* is pixels of the background class and 
10 consists of pixels with gray levels r iow to r k , and C 2 is pixels of the object class 
and is composed of pixels with gray levels r k +1 to r high . The range-constrained 
weighted variance method maximises the "weighted between-class variance" 
defined as: 



15 where Wi and W 2 are two positive constants selected by the user and 
representing the weights of the two respective class variances, Pr(.) denotes 
the class probability, i.e. 



the range r{ 



X J to r( , such as 0 RCVLD = (r/ + r x J )/2. 




and D(Ci) and DfC^) are given by: 



20 D(C X ) = (A) -/i r ) 2 and D(C 2 ) = {ju x - ju T Y . where M r = £ /x *(0 * 




When Wi is bigger than W 2 , background homogeneity is emphasised. 



3. 



Range-constrained fuzzy c-partition thresholding method (RCFCP) 



25 
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This third method is related to the technique used in [7], and the justification 
for it is as given there. In general terms, let A b /A 0 be the fuzzy sets of fuzzy 
events "background/object" (which denotes a fuzzy partition of the set 
{r iow ,...,r high with a membership function ju Ab ///^respectively). The probability 
of these fuzzy events are given by: 



H A i) = (j)* h j • where A, e {A b9 A Q } , and the weighted entropy with this 

fuzzy partition can be calculated as: 

S(W } ,W 2 ) = W x x P(A b ) x log P(A b ) + W 2 x P(A 0 ) x log P(A 0 ) 

where Wi and W 2 are two positive constants, and log(.) is the natural 
10 logarithm. 

Let r IOW <a<c< r high . The membership functions can be defined as follows: 



Ma» O) = 
and 



U r low <x<a 

(x — c) I (a - c) a < x < c 

0 c < x <: 



1. ^ * ^ A 

(x-a)l(c-d) a<x<c 



0 c<^^ 
15 The optimum parameters a* and c* are chosen to maximise the entropy 
S(W/,Wy, and the optimum threshold is 9 RCFCP - {a +c)l2. 

Having now presented the steps of the embodiment in principle, we turn to an 
example of the embodiment in operation. This example uses the form of step 
20 4 referred to above as RCLVD. 



The starting point of the method is the image shown in Fig. 2, an MR 
(Magnetic Resonance) image which is a T1 -weighted or SPGR (spoiled 
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gradient recalled acquisition) axial slice around the intercommissural plane. 
This image is input in step 1 of the method. 

In step 2 of the method, we calculate the pixels enclosed by the skull (i.e. find 
5 the ROI) using the following steps: the usual histogram-based thresholding 
method is used to binarise the axial slice; a morphological closing operation is 
used to connect small gaps; the largest connected component is identified; 
and the holes within the component are filled. The resulting ROI (the pixels 
enclosed by the skull) is shown in Fig. 3. 

10 

In step 3, the two percentages per 0 and pen are set as 14% and 28%. This 
selection is based on previous experiments and/or other prior knowledge. 

In step 4 of the method (RCLVD), we select the<5foto be 1% (alternatively any 
15 value in the range 1% to 5% would be suitable). Fig. 4 shows the histogram of 
frequencies in the ROI, and the calculated threshold 0 RCLVD is shown as the 
line indicated. This completes the procedure of the embodiment. 

The output threshold of the method is used as in conventional techniques to 
20 binarise the image. The binarised image is shown in Fig. 5. 

Although only a single embodiment of the invention has been described, 
many variations are possible within the scope of the invention as will be clear 
to a skilled reader. 

25 
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